Measuring Human-perceived Similarity in Heterogeneous Collections
نویسندگان
چکیده
We present a technique for estimating the similarity between objects such as movies or foods whose proper representation depends on human perception. Our technique combines a modest number of human similarity assessments to infer a pairwise similarity function between the objects. This similarity function captures some human notion of similarity which may be difficult or impossible to automatically extract, such as which movie from a collection would be a better substitute when the desired one is unavailable. In contrast to prior techniques, our method does not assume that all similarity questions on the collection can be answered or that all users perceive similarity in the same way. When combined with a user model, we find how each assessor’s tastes vary, affecting their perception of similarity.
منابع مشابه
Automatic keyword extraction using Latent Dirichlet Allocation topic modeling: Similarity with golden standard and users' evaluation
Purpose: This study investigates the automatic keyword extraction from the table of contents of Persian e-books in the field of science using LDA topic modeling, evaluating their similarity with golden standard, and users' viewpoints of the model keywords. Methodology: This is a mixed text-mining research in which LDA topic modeling is used to extract keywords from the table of contents of sci...
متن کاملMeasuring Concept Similarity of Heterogeneous Ontologies in Multi-angent System
Different kinds of agents in a multi-agent system have different knowledge structure, which results in difficulties of interaction and coordination among agents. At present, ontology based knowledge representation is an effective way of resolving such difficulties, the key of which is heterogeneous ontology concept similarity measuring. In this paper, we designed an integrated heterogeneous ont...
متن کاملHighly Heterogeneous XML Collections: How to Retrieve Precise Results?
Highly heterogeneous XML collections are thematic collections exploiting different structures: the parent-child or ancestor-descendant relationships are not preserved and vocabulary discrepancies in the element names can occur. In this setting current approaches return answers with low precision. By means of similarity measures and semantic inverted indices we present an approach for improving ...
متن کاملA framework for comparing heterogeneous objects: on the similarity measurements for fuzzy, numerical and categorical attributes
Real-world data collections are often heterogeneous (represented by a set of mixed attributes data types: numerical, categorical and fuzzy); since most available similarity measures can only be applied to one type of data, it becomes essential to construct an appropriate similarity measure for comparing such complex data. In this paper, a framework of new and unified similarity measures is prop...
متن کاملSoft Computing A Framework for Comparing Heterogeneous Objects: on the Similarity Measurements for Fuzzy, Numerical and Categorical Attributes
Real-world data collections are often heterogeneous (represented by a set of mixed attributes data types: numerical, categorical and fuzzy); since most available similarity measures can only be applied to one type of data, it becomes essential to construct an appropriate similarity measure for comparing such complex. In this paper, a framework of new and unified similarity measures is proposed ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1802.05929 شماره
صفحات -
تاریخ انتشار 2018